Investigating the Contribution of Distributional Semantic Information for Dialogue Act Classification

نویسندگان

  • Dmitrijs Milajevs
  • Matthew Purver
چکیده

This paper presents a series of experiments in applying compositional distributional semantic models to dialogue act classification. In contrast to the widely used bag-ofwords approach, we build the meaning of an utterance from its parts by composing the distributional word vectors using vector addition and multiplication. We investigate the contribution of word sequence, dialogue act sequence, and distributional information to the performance, and compare with the current state of the art approaches. Our experiment suggests that that distributional information is useful for dialogue act tagging but that simple models of compositionality fail to capture crucial information from word and utterance sequence; more advanced approaches (e.g. sequenceor grammar-driven, such as categorical, word vector composition) are required.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Joint Semantic Vector Representation Model for Text Clustering and Classification

Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...

متن کامل

Semantic Features for Dialogue Act Recognition

Dialogue act recognition commonly relies on lexical, syntactic, prosodic and/or dialogue history based features. However, few approaches exploit semantic information. The main goal of this paper is thus to propose semantic features and integrate them into a dialogue act recognition task to improve the recognition score. Three different feature computation approaches are proposed, evaluated and ...

متن کامل

The Distribution of Mood An Exploration of Distributional Compositions in Sentiment Classification

Distributional semantics is a research area investigating unsupervised datadriven models for quantifying semantic relatedness. This thesis investigates the possibilities of using distributional semantic models for sentiment classification of utterances, by composing distributional vectors of words in utterances. For evaluation I use a set of manually classified movie reviews. While the purpose ...

متن کامل

Understanding questions and finding answers: semantic relation annotation to compute the Expected Answer Type

The paper presents an annotation scheme for semantic relations developed and used for question classification and answer extraction in an interactive dialogue based quiz game. The information that forms the content of this game is concerned with biographical facts of famous people’s lives and is often available as unstructured texts on internet, e.g. Wikipedia collection. Questions asked as wel...

متن کامل

Automatic Utterance Segmentation in Instant Messaging Dialogue

Instant Messaging (IM) chat sessions are real-time, text-based conversations which can be analyzed using dialogue-act models. Dialogue acts represent the semantic information of an utterance, however, messages must be segmented into utterances before classification can take place. We describe and compare two statistical methods for automatic utterance segmentation and dialogue-act classificatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014